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1  Introduction 


It  is  well-known  that  optimality  conditions  are  central  to  both  theoretical  and  computational  advances 
in  optimization.  They  were  developed  over  centuries  starting  with  the  pioneering  works  of  Bishop 
N.  Oresrne  (14th  century)  and  P.  de  Fermat  (17th  century),  and  brought  to  their  modern  form  by 
Karush,  John,  Kuhn,  Tucker,  Polak,  Mangasarian,  Fromowitz,  and  many  others.  In  this  paper,  we 
discuss  quantification  of  first-order  necessary  optimality  conditions  in  terms  of  optimality  functions 
as  developed  by  E.  Polak  and  co-authors;  see  [12]  for  numerous  examples  in  nonlinear  programming, 
semiinfinte  optimization,  and  optimal  control  as  well  as  [14,  17,  5,  9]  for  recent  applications  in  stochastic 
and  semiinfinte  programming,  nonsmooth  optimization,  and  control  of  uncertain  systems. 

It  is  apparent  that  how  “far”  a  set  of  equalities,  inequalities,  and  inclusions  are  from  being  satisfied 
can  be  quantified  in  numerous  ways.  The  framework  of  optimality  functions,  as  laid  out  in  [12,  Section 
3.3]  and  references  therein,  stipulates  axiomatic  requirements  that  such  quantifications  should  satisfy  to 
facilitate  the  study  and  computation  of  approximate  stationary  points.  Specifically,  for  an  optimization 
problem  that  can  only  be  “solved”  through  the  solution  of  an  approximating  problem,  one  seeks  to 
determine  whether  a  near-stationary  point  of  the  approximating  problem  is  an  approximate  stationary 
point  of  the  original  problem.  The  requirements  on  optimality  functions  exactly  ensure  this  property. 
Moreover,  there  is  ample  empirical  indications  and  some  theoretical  evidence  (see  for  example  [17,  16, 
15,  8])  that  computational  benefits  accrue  from  approximately  solving  a  sequence  of  approximating 
problems  with  increasing  fidelity,  each  warm-started  with  the  previously  obtained  point.  Optimality 
functions  are  tools  to  carry  out  such  a  scheme  and  give  rise  to  adaptive  rules  for  determining  the  timing 
of  switches  to  higher-fidelity  approximations.  Consequently,  the  framework  of  optimality  functions 
provides  a  pathway  to  constructing  implementable  algorithms  consisting  only  of  a  finite  number  of 
arithmetic  operations  and  function  evaluations^. 

In  this  paper,  we  review  the  notion  of  optimality  functions  and  illustrate  the  vast  number  of  pos¬ 
sibilities  through  several  examples.  In  a  novel  application  to  nonlinear  programming,  we  establish  the 
convergence  of  a  primal  interior  point  method  in  the  absence  of  constraint  qualifications  and  convexity 
assumptions.  We  show  that  lopsided  convergence  of  bifunctions  [6,  7]  is  a  useful  tool  for  analyzing  opti¬ 
mality  functions  and  the  associated  stationary  points.  In  particular,  we  prove  that  lopsided  convergence 
of  certain  bifunctions,  defining  optimality  functions  of  approximating  problems,  to  a  bifunction  asso¬ 
ciated  with  an  optimality  function  of  the  original  problem,  guarantees  the  axiomatic  requirements  on 
optimality  functions.  Moreover,  we  provide  characterizations  of  stationary  points  under  perturbations 
and  approximations  using  lopsided  convergence.  In  the  process,  we  extend  the  primary  definitions  and 
results  on  lopsided  convergence  in  [6,  7]  from  finite  dimensions  to  any  metric  space. 

The  paper  is  organized  as  follows.  Section  2  defines  optimality  functions  and  gives  several  ex¬ 
amples.  Section  3  introduces  approximating  optimization  problems,  epi-convergence,  and  consistent 
approximations  as  defined  by  corresponding  optimality  functions,  and  demonstrates  the  implication  for 
algorithmic  development.  Section  4  develops  lopsided  convergence  in  the  context  of  metric  spaces.  The 
paper  ends  by  utilizing  lopsided  convergence  in  the  context  of  optimality  functions. 

^The  distinction  between  implementable  and  conceptual  algorithms  appears  to  be  due  to  E.  Polak  [11,  10]. 
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2  Optimality  Functions:  Definitions  and  Examples 

We  consider  optimization  problems  defined  on  a  metric  space  (X,px),  where  C  C  X  is  a  nonempty 
feasible  set  and  /  :  C  — >  1R  an  objective  function,  i.e.,  problems  of  the  form 

minimize  f(x )  subject  to  x  €  C  C  X. 

The  function  /  might  be  defined  and  finite-valued  outside  C.  but  that  will  be  immaterial  to  the  following 
treatment.  Therefore,  the  notation  f  :  C  M  specifies  the  components  /  and  C  of  optimization 
problems  of  this  form,  without  implying  that  /  is  necessarily  finite  only  on  C. 

We  denote  by  infc  /  £  [—00, 00)  and  argminc.  /  C  C  the  corresponding  optimal  value  and  set  of 
optimal  points,  respectively,  the  latter  possibly  being  empty.  For  e  >  0,  the  set  of  e-optimal  solutions 
is  denoted  by 

e-  argminc  f  =  {x  £  C  \  f(x)  <  infc  /  +  e}  • 

As  usually,  we  say  that  x*  €  X  is  locally  optimal  (for  /  :  C  — >•  1R )  if  there  exists  a  5  >  0  such  that 
f(x*)  <  f(x )  for  all  x  £  C  with  px{x,  x*)  <  5. 

Throughout  the  paper,  we  have  that  C  is  a  nonempty  subset  of  X  and  M-  =  [— 00, 0].  We  charac¬ 
terize  stationary  points  in  terms  of  optimality  functions  as  defined  next. 

2.1  Definition  (optimality  function)  An  upper  semicontinuous  function  9  :  X  — >•  M-  is  an  optimality 
function  for  f  :  C  1R  if  C  C  X  C  X  and 

x  €  C  locally  optimal  for  f  :  C  — >■  1R  =>-  6{x)  =  0. 

The  corresponding  sets  of  stationary  points  and  quasi-stationary  points  are  Sc, 8  =  {x  €  C  \  9(x)  =  0} 
and  Qo  =  {x  £  X  \  9(x)  =  0},  respectively. 

A  series  of  examples  help  illustrate  the  concept;  see  also  §5  and  [12,  14,  17,  5,  9]. 

Example  1:  Constrained  Optimization  over  Convex  Set.  Consider  the  case  X  = 
closed  and  convex,  and  f  :  C  M  continuously  differentiable.  Then,  the  function 

0 (x)  =  mm  j  {Vf(x),y-x)  +  ^||y-.'c||2|  ,  x  £  X  =  C, 

satisfies  the  requirements  of  Definition  2.1  and  is  therefore  an  optimality  function  for  / 

C  =  ]Rn,  then  the  expression  simplifies  to 

6(x)  =  -1-\\Xf(x)\\2, 

which,  of  course,  corresponds  to  the  classical  condition  V/(x)  =  0. 

Example  2:  Nonlinear  Programming.  Consider  the  case  X  =  lRn .  C  =  {x  £  lRn  \  fj(x)  <  0 ,j  = 
1, ...,  q},  and  /,  f\,  ...,  fq  real-valued  and  continuously  differentiable  on  Mn.  Let  ip{x)  =  maxj=ir..)(?  fj(x) 


lR  n ,  C  C  X 


:  C  —>  JR.  If 


(1) 
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and  ip+(x)  =  max{0, ip(x)}.  Then,  the  function 

0{x)  =  min  max  j  - ^+(x)  +  (Vf(x),y-  x)  +  -|| y-  x||2, 
y&R  [  2 

max  {fj(x)  -  i/)(x)+  +  (Vfj(x),y~  x)}  +  h\y  —  x\\2  \  ,  x  G  X  =  M'\  (2) 

J  =  l,...,q  2  ) 

satisfies  the  requirements  of  Definition  2.1  and  is  therefore  an  optimality  function  for  /  :  C  JR.  The 
condition  6{x)  =  0  is  equivalent  to  the  Fritz-John  conditions  in  the  sense  that  when  x  G  C, 

q 

9(x)  =  0  there  exist  n o,Hi,  . . . ,  jiq  >  0,  with  E  N  = 

j= o 

<? 

such  that  MoV/(x')  +  E  HjVfjix)  =  0 
i= i 
q 

=  °- 

3= i 

However,  since  9  is  dehned  beyond  C.  it  might  also  be  associated  with  quasi-stationary  points  outside 
C.  We  refer  to  [12,  Theorem  2.2.8]  for  proofs  and  further  discussion. 

Example  3:  Minimax  Problem.  Consider  the  case  X  =  C  =  Mn  and  f(x)  =  rna  xzez  <f(x,  z), 
x  €  JRn,  where  (p  :  lRn  x  1RP  —$■  M  is  continuous,  the  gradient  :  Mn  x  Mp  — »  lRn  with  respect  to 
the  first  argument  exists  and  is  continuous  in  both  arguments,  and  Z  is  a  compact  subset  of  ]RP.  Then, 
the  function 

9{x)  =  min  max  <  <p(x,  z)  —  f(x)  +  (’ Vx<p(x ,  z),  y  —  x)  +  -\\y  —  x||2  1  ,  x  €  X  =  Mn,  (3) 
y£Rn  z€Z  [  2  J 

satisfies  the  requirements  of  Definition  2.1  and  is  therefore  an  optimality  function  for  /  :  Mn  — >•  ]R. 
Moreover,  6{x)  =  0  if  and  only  if  0  G  df(x )  (the  subdifferential  of  /);  see  [12,  Theorem  3.1.6]  for  details. 

We  note  that  the  upper  semicontinuity  of  optimality  functions  ensures  the  computationally  signifi¬ 
cant  property  that  if  a  sequence  xu  x  and  9(xu)Z  0,  for  example  with  {. xv }  obtained  as  approximate 
solutions  of  a  corresponding  optimization  problem  with  gradually  smaller  tolerance,  then  6(x)  =  0  and 
x  G  Qg,  i.e.,  x  is  quasi-stationary.  Although  not  discussed  further  here,  the  optimality  functions  in 
Examples  1-3,  and  others,  are  also  instrumental  in  constructing  descent  directions  for  the  respective 
optimization  problems;  see  [12]  for  details. 

3  Approximations  and  Implementable  Algorithms 

Problems  involving  functions  defined  in  terms  of  integrals  or  optimization  problems  (as  the  maxi¬ 
mization  in  Example  3),  functions  defined  on  infinite-dimensional  spaces,  and/or  feasible  sets  defined 
by  an  infinite  number  of  constraints  almost  always  require  approximations.  For  example,  one  might 
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resort  to  an  approximating  space  Xv  C  X  with  points  characterized  by  a  finite  number  of  parame¬ 
ters.  Here,  the  superscript  v  indicates  that  we  might  consider  a  family  of  such  approximating  spaces, 
v  G  JV  =  {1,  2, ..., },  with  usually  UI/£]yXl/  dense  in  X .  A  feasible  set  Cv  C  Xu  may  be  an  approxima¬ 
tion  of  C  or  simply  Cv  =  C  n  Xv;  see  §5  for  a  concrete  illustration  in  the  area  of  optimal  control.  A 
function  fv  :  Cv  — >•  M  could  be  a  tractable  approximation  of  /  :  C  — >•  1R.  An  example  helps  illustrate 
the  situation. 

Example  3:  Minimax  Problem  (cont.).  Suppose  that  fu(x)  =  max2g zv(p(x,z),  x  G  Mn,  with 
Zv  C  Z  consisting  of  a  finite  number  of  points.  Clearly,  fv  is  a  (lower  bounding)  approximation  of 
/  =  ma y-z^z  </?(•,  z)  as  defined  above.  The  function  fv  :  Mn  — >  M  can  be  associated  with  the  optimality 
function 

9u(x)  =  min  max  { ip(x,  z )  —  fu(x)  +  (Vx<p(x,  z),  y  —  x)  +  - \\y  —  x\\2  1  ,  x  €  X  =  lRn, 
yeRn  zeZ1'  (  2  J 

which,  as  formalized  in  §5,  approximates  the  optimality  function  8  in  (3).  We  note  that  8U  can  be  evalu¬ 
ated  in  finite  time  by  solving  a  convex  quadratic  program  with  linear  constraints;  see  [12,  Theorem  2.1.6]. 

We  next  examine  approximating  functions  fu  :  Cu  — >•  1R  and  review  the  notion  of  epi-convergence, 
which  provides  a  path  to  establishing  that  optimal  points  of  the  corresponding  approximating  problems 
indeed  approximate  optimal  points  of  an  original  problem.  To  establish  the  analogous  results  for 
stationary  points,  we  turn  to  optimality  functions  and  slightly  extend  the  approach  in  [12,  Section  3.3] 
by  considering  arbitrary  metric  spaces  and  other  minor  generalizations.  The  section  ends  with  a  result 
that  facilitates  the  development  of  implementable  algorithms  for  the  minimization  of  /  :  C  —>■  M,  which 
is  then  illustrated  with  the  construction  of  an  interior  point  method.  Throughout  the  paper,  we  have 
that  Cv  is  a  nonempty  subset  of  X. 

3.1  Epi-Convergence 

We  recall  that  epi-convergence,  as  defined  next,  is  the  key  property  when  examining  approximations  of 
optimization  problems;  see  [1,  3,  13]  for  more  comprehensive  treatments. 

3.1  Definition  (epi-convergence)  The  functions  {fu  :  Cu  — >•  ]R]V^  epi-converge  to  f  :  C  — >•  M  if 

(i)  for  every  sequence  xv  — >  x  G  X,  with  xv  G  Cu ,  we  have  that  liminf  /"( xv)  >  f(x)  if  x  G  C  and 
fv{xu)  — >•  oo  otherwise; 

(ii)  for  every  x  G  C,  there  exists  a  sequence  {xu}uej^,  with  xu  G  Cv ,  such  that  xu  — >•  x  and 
lirnsup  /"(x^)  <  f(x). 

A  main  consequence  of  epi-convergence  is  the  following  well-known  result. 

3.2  Theorem  (convergence  of  minimizers)  Suppose  that  the  functions  {  fv  :  Cv  — >•  epi-converge 

to  f:C->]R.  Then, 

lirnsup  (info  f)  <  hifc  /. 

Moreover,  if  xk  G  argnraxCi/fc  fVk  and  xk  — >•  x  for  some  increasing  subsequence  {v i,zz2,...}  C  JV,  then 
x  G  argmaxc  /  and  limfc^oo  infc^  fVk  =  inf'c  /• 


5 


Proof.  The  second  part  is  essentially  in  [2,  Theorem  2.5],  but  not  for  the  finite- valued  setting.  The  first 
and  second  parts  are  in  [6,  Theorem  2.6]  for  the  Mn  case.  The  proof  carries  over  essentially  verbatim. 

□ 

A  strengthening  of  the  notion  of  epi-convergence  ensures  the  convergence  of  infima. 

3.3  Definition  (tight  epi-convergence)  The  functions  {fu  :  Cv  -A  1R}u&n  epi-converge  tightly  to 
f  :  C  -A  1R  if  fv  epi-converge  to  f  and  for  all  e  >  0,  there  exists  a  compact  set  Bs  C  X  and  an  integer 
ve  such  that 

infsen c-  fu  <  inf cv  fv  +  £  for  all  u>u£. 

3.4  Theorem  (convergence  of  infima)  Suppose  that  the  functions  {fu  :  Cv  — >  fR}U£N  epi-converge  to 
f  :  C  M  and  infc  /  is  finite.  Then,  they  epi-converge  tightly 

(i)  if  and  only  if  inf'c^  fv  — >  inf'c  /. 

(ii)  if  and  only  if  there  exists  a  sequence  such  that  ev- argmin  fv  set-converged  to  argmin  /. 

Cv  C 

Proof.  Again,  the  proof  in  [6,  Theorem  2.8]  can  be  immediately  translated  to  the  present  setting.  □ 

3.2  Consistent  Approximations 

The  convergence  of  optimal  points  as  stipulated  above  is  fundamental,  but  an  analogous  result  for 
stationary  points  is  also  important,  especially  for  nonconvex  problems.  Optimality  functions  play  a 
central  role  in  the  development  of  such  results.  Combining  epi-convergence  with  a  limiting  property 
for  optimality  functions  lead  to  consistent  approximations  in  the  sense  of  E.  Polak  as  defined  next.  We 
note  that  our  definition  of  consistent  approximations  is  an  extension  from  that  in  [12,  Section  3.3]  as 
we  consider  arbitrary  metric  spaces  and  not  only  normed  linear  spaces. 

3.5  Definition  (consistent  approximations)  The  pairs  {( fv  :  Cv  -A  M,  6V  :  Xv  — »•  fR-)}u&jv  of 
functions  and  corresponding  optimality  functions  are  weakly  consistent  approximations  of  the  function 
and  optimality- function  pair  (/  :  C  — »  1R,  9  :  X  — >•  M-)  if 

(i)  {fu  :  Cu  — >  ]R}U£]v  epi-converge  to  f  :  C  M  and 

(ii)  for  every  x1'  — >•  x  €  X,  with  xu  €  Xv ,  limsup  9u(xu)  <  0(x)  if  x  €  X,  and  9v(xv)  — >  —  oo  otherwise. 

If  in  addition  9v(x)  <  0  for  all  x  €  XU\CU  and  u,  then  {(fu  :  Cu  — >  1R,  6U  :  Xu  — >•  are 

consistent  approximations  of  (/  :  C  1R,  6  :  X  — >•  M-). 

We  recall  that  the  epigraph  of  /  :  C  — >  1R  is  defined  by 

epi  /  =  {(x,  xo)  G  X  x  IR  \  x  G  C,f(x)  <  xo}. 

^We  recall  that  the  outer  limit  of  a  sequence  of  sets  {Al'}„£ jv,  denoted  by  limsup  A" ,  is  the  collection  of  points  y 
to  which  a  subsequence  of  {y1'} i/g/v,  with  yv  £  A",  converges.  The  inner  limit,  denoted  by  liminf  A?,  is  the  points  to 
which  a  sequence  of  {yI'}I,Siv,  with  yv  £  A1' ,  converges.  If  both  limits  exist  and  are  identical,  we  say  that  the  set  is  the 
Painleve-Kuratowski  limit  of  {A1' jvgiv  and  that  A1'  set-converges  to  this  set;  see  [4,  13]. 
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Since  epi-convergence  is  equivalent  to  the  set-convergence^  of  the  corresponding  epigraphs,  we  have  that 
Definition  3.5(i)  amounts  to 

epi  fv  set-converges  to  /. 

Similarly,  the  hypograph  of  /  :  C  — >  M  is  defined  by 

hypo  f  =  {(x,  xq)  £XxM\x£C,  f(x)  >  xq\. 

In  view  of  the  definition  of  set-convergence,  we  therefore  have  that  Definition  3.5(h)  amounts  to 

lirnsup hypo 9V  C  hypo#. 

The  additional  condition  in  Definition  3.5  removing  “weakly”  can  be  viewed  as  a  constraint  qual¬ 
ification  as  it  eliminates  the  possibility  of  quasi-stationary  points  that  are  not  stationary  point  for 
fv  :  Cv  — >  M,  which  might  occur  if  the  domain  of  6U  is  not  restricted  to  Cu  or  other  conditions  are 
included. 

The  main  consequence  of  consistency  is  given  next. 

3.6  Theorem  (convergence  of  stationary  points)  Suppose  that  the  pairs  {( fv  :  Cv  IR,  9U  :  Xu  — > 
1R-)}v&n  are  weakly  consistent  approximations  of  (/  :  C  — >•  1R,  6  :  X  — >  J?„)  and  {xu}u£n,  x"  €  Xv , 
is  a  sequence  satisfying 

9v{xv)  >  —ev  for  all  u,  with  ev  >  0  and  e1'  — >•  0. 

Then,  every  cluster  point  x  of  {xu}u£n  satisfies  x  €  Qg,  i.e.,  9(x)  =  0. 

If  in  addition  the  pairs  are  consistent  approximations,  £u  =  0  for  sufficiently  large  u,  and  {fv  {xu)}u&iv 
is  bounded  from  above,  then  x  €  C,  i.e.,  x  €  Sc.o ■ 

Proof.  Suppose  that  xu  — >•  x.  Since  —£u  <  9u(xu)  for  all  u,  x  €  X.  Moreover,  0  <  limsup  9u(xu)  < 
9{x)  <  0  and  the  first  conclusion  follows.  In  view  of  the  definition  of  consistent  approximations,  we 
find  that  9u(xu)  =  0  for  sufficiently  large  v  and  therefore  xv  €  Cu  for  such  za  The  epi-convergence  of 
fv  :  Cu  — >  ]R  to  /  :  C  — >  1R  implies  that  liminf  fu(xv)  >  f{x)  if  x  G  C  and  fu(xu)  — >  oo  if  x  0  C.  The 
latter  possibility  is  ruled  about  by  assumption  and  therefore  x  €  C.  □ 

3.3  Algorithms 

Theorem  3.6  provides  a  direct  path  to  the  construction  of  an  implementable  algorithm  for  minimizing 
/  :  C  -»  M.  Specifically,  construct  a  family  of  approximations  {fu  :  Cv  — >  1R}  and  corresponding 
optimality  functions  {9U  :  Xu  — >•  M-},  and  then  implement  the  following  algorithm. 

Algorithm. 

1.  Set  £v  >  0,  with  £u  0,  and  v  =  1. 

2.  Obtain  an  approximate  (quasi-)stationary  point  xv  for  fu:Cu-^-lR  that  satisfies  9u(xu)  >  —  £u. 

§Hcre,  we  consider  set-convergence  of  subsets  of  X  x  R,  which  is  equipped  with  the  metric  p((x,x o),(x',x'o))  = 
max{p^(x,  *'),  |*o  —  *o|}  for  *,*'  £  X  and  Xq,x'0  €  M. 
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3.  Replace  v  by  v  +  1  and  go  to  Step  2. 


If  the  pairs  {(/"  :  Cu  ^  M,  0U  :  are  weakly  consistent  approximations  of  (/  :  C  — » 

1?,  9  :  X  ^  J?_),  then  every  cluster  point  of  the  constructed  sequence  {x^}  will  be  quasi-stationary 
for  f  :  C  —)■  M  by  Theorem  3.6.  The  algorithm  is  fully  implementable  under  the  practically  reasonable 
assumption  that  one  can  obtain  an  approximate  quasi-stationary  point  of  fv  :  Cv  — >•  ]R  in  finite  time. 


Example  2:  Nonlinear  Programming  (cont.).  Consider  the  standard  logarithmic  barrier  approx¬ 
imation 

q 

.TO)  =  / 0)  -  x  <E  (T  =  {x  (E  |  fj(x )  <  0 ,j  =  1, 

3= 1 

where  i"\0.  We  first  establish  epi-convergence  of  fv  :  (7"  — >•  to  /  :  C  — >•  .ff?.  Suppose  that  x*'  — >•  x, 
with  xy  €  C1' .  Since  Cu  <Z  C  and  C  is  closed,  x  €  C.  Let  e  >  0.  There  exists  a  ve  such  that 
— tv  log[— >  —e/q  for  all  j  with  log[— fj(x)]  >  0  and  v  >  ve.  Hence, 

fv(xv)  >  /(/)  —  e  for  all  v  >  ve. 


In  view  of  the  continuity  of  /  and  the  fact  that  e  is  arbitrary,  we  conclude  that  Definition  3.1  (i)  is 
satisfied.  Next,  let  x  €  C.  There  exists  a  sequence  {xu}u£]y  such  that  xu  €  Cv  tends  to  x  sufficiently 
slowly  such  that  tv  X^j=i  h)g[— fj(xl>)\  0-  Consequently,  fu(xu)  — >•  /(x),  which  satisfies  Definition 
3.1(h).  Thus,  fu:Cu^M  epi-converge  to  f  :  C  1R.  We  next  analyze  optimality  functions.  Using  a 
minmax  theorem,  one  can  show  that  (2)  is  equivalently  expressed  as 


q  1 
6>(x)  =  -  min  ^  Mo^+(®)  +  ^  HiVl>+{x)  ~  fj(x)]  +  - 


3= i 


Mo  V/(x)  +  ^MjV/j(x) 
3=1 


,xel  =  r 


(4) 

where  M  =  {(no,  Mi,  •••,  Mg)  I  Mj  —  0,j  =  0, 1, ...,  g,  =  1};  see  [12,  Theorem  2.2.8].  By  (1)  and 

direct  differentiation  of  Z1',  we  obtain  an  approximating  optimality  function 


0"(x)  =  — 
v  '  2 


V/(x)  +  ^mJ(x)V/j(x) 

3=1 


where 


rrij(x)  = 


/jW 


Suppose  that  xv  — >•  x  €  J?," ,  with  xu  £  Cu.  Since  xu  £  Cu  C  C  and  C1  is  closed,  x  €  C.  Let 


c 


V 


and  jXj 


7Tlj(xU) 

cT 


,3  =  1, -,9- 


Consequently,  /U  =  (//q,  ...,  //() )  €  M  for  all  i/.  Since  M  is  compact,  {/U}  has  at  least  one  convergent 

subsequence.  Suppose  that  jiu  —tN  fi 00 ,  with  N  an  infinite  subsequence  of  IN.  If  j  is  such  that  fj(x)  <  0, 


then  fi'-  -$N  0  and  consequently  =  0  necessarily.  In  view  of  the  continuity  of  the  gradients,  we  then 
have  that 


1  JL  muj(xu) 

v/OO  +  E  h, 

2 

mS°v/(x)  +  X>°°v/,0 e) 

3=1 

3=1 

Since  x  €  C,  ip+(x)  =  0.  Therefore  we  also  have  that 


9u{xu) 

“FF 


.N 


<? 


-/*oV+(®)  -  fj (x)] 

3= 1 


<  0(x), 


where  the  inequality  follows  from  the  fact  that  /i°°  €  M  furnishes  a  possibly  suboptimal  solution  in 
(4).  Because  6v(xu)  <  0  and  {cv)2  >  1,  the  inequality  remains  valid  when  we  drop  the  denominator  on 
the  left-hand  side.  Hence,  we  have  shown  that  lirnsup  9u(xu)  <  0(x).  This  establishes  the  consistency 
of  {/"  :  Cu  — >•  M,  6V  :  Cv  — >•  M-)}.  Consequently,  the  above  algorithm,  which  can  then  be  viewed  as 
a  primal  interior  point  method,  generates  cluster  points  that  are  stationary  for  /  :  C  —>  M  in  the  sense 
of  Fritz-John.  We  observe  that  this  is  achieved  without  any  constraint  qualifications  and  convexity  as¬ 
sumptions.  In  this  case,  Step  2  of  the  algorithm  can  be  achieved  by  any  of  the  standard  unconstrained 
optimization  methods  in  finite  time. 


The  key  technical  challenge  associate  with  the  above  scheme  is  to  establish  (weak)  consistency.  In 
the  next  section,  we  provide  tools  for  this  purpose  that  rely  on  lopsided  convergence. 


4  Lopsided  Convergence 

In  view  of  the  definition  of  optimality  functions,  it  is  apparent  that 

if  Qe  /  0,  then  Qe  =  argmaxx  0. 

Moreover,  Examples  1-3  indicate  that  many  optimality  functions  take  the  form 

9{x)  =  inf  F(x,y),  with  Y  cy  (5) 

y&Y 

for  some  metric  space  (T-  py)  and  function  F.  In  fact,  in  our  examples,  y  =  JRn  and  F  involves 
gradients  and  other  quantities;  §5  provides  an  example  in  infinite  dimensions.  From  these  observations 
it  is  apparent  that  the  consideration  of  maxinf-problems  of  the  form 

max  inf  F(x,  y) 
xex  yeY  v 

for  bifunction  F  :  X  xY  — >•  M  will  provide  direct  insight  about  (quasi-)stationary  points  of  optimization 
problems.  We  therefore  set  out  to  describe  the  fundamental  tool  for  examining  the  convergence  of  such 
maxinf-problems,  which  is  lopsided  convergence.  In  the  process,  we  extend  some  of  the  results  in  [6,  7] 
to  general  metric  spaces. 
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Suppose  that  (X,px)  and  ( Y,py )  are  metric  spaces,  X  C  X  and  Y  C  y  are  nonempty,  and 
F  :  X  x  Y  —>  R  is  a  bifunction.  We  say  that  x*  is  a  maxinf-point  of  F  if 

x*  €  argmax  <  inf  F(x,  y ) 
xex  Uev 

The  study  of  such  functions  is  facilitated  by  the  notion  of  lopsided  convergence  as  defined  next. 

4.1  Definition  (lopsided  convergence)  The  bifunctions  {Fu  :  Xu  x  Yu  — >•  R}v^jv  lop-converge  to 
F  :  X  xY  -t  R.  if 

(i)  for  all  y  G  Y  and  xu  — >  x  G  X,  with  xv  G  Xv ,  there  exists  yu  — >•  y,  with  yv  G  Yu ,  such  that 
limsup  Fv{xu,yv)  <  F(x,y)  if  x  G  X  and  Fl'(xl',yl>)  — >•  — oo  otherwise. 

(ii)  for  all  x  £  X,  there  exists  xu  — >•  x,  with  xv  G  X",  such  that  for  all  yu  — >•  y  G  T ,  with  yu  G  W, 
liminf  Fv{xv,  yv)  >  F(x,y)  if  y  €  T  and  Fv{xv,  yv)  — >•  oo  otherwise. 

We  assume  throughout  that  the  sets  X17  C  and  C  T  are  nonempty.  We  start  with  a  preliminary 
result. 

4.2  Proposition  (epi-convergence  of  slices)  Suppose  that  the  bifunctions  {Fu  :  Xu  x  Yu  — >  R}v^ 
lop-converge  to  F  :  X  x  Y  M.  Then,  for  all  x  €  X,  there  exists  xv  — >•  x,  with  xv  €  such  that  the 
functions  Fv(xv ,  •)  epi-converge  to  F(x,  •). 

Proof.  We  follow  the  same  arguments  as  in  [6,  Proposition  3.2],  where  X  =  Rn  is  considered.  From 
Definition  4.1(ii)  there  exists  xu  — >•  x,  with  xu  €  Xv ,  such  that  the  functions  {Fv(xv,  -)}^eiV  and  F{ x,  •) 
satisfy  Definition  3.1  (i) .  From  Definition  4. 1  (i) ,  for  any  y  £  Y  and  xv  — >•  x,  with  xu  €  Xv ,  one  can  find 
yu  — >•  y,  with  yv  €  y1',  such  that  Definition  3.1(h)  is  also  satisfied.  □ 

We  recall  that  the  inf-projections  of  the  bifunctions  Fu  :  Xu  x  Yu  — >  R  and  F  :  X  x  Y  R  are 
defined  as  the  functions 

h(x)  =  inf  F(x,  y),  for  i£l,  and  hv (x)  =  inf  Fu(x,y),  for  i£l. 
y&  yeY v 

In  addition  to  their  overall  interest,  inf-projections  of  bifunctions  are  central  to  the  study  of  optimality 
functions  as  clearly  highlighted  by  (5). 

4.3  Theorem  (containment  of  inf-projections)  Suppose  that  the  bifunctions  {Fu  :  Xv  x  Yu  — >•  R}u^]y 
lop-converge  to  F  :  X  X  Y  — >  R  and  — oo  <  infy  F(x,  •)  for  some  x  €  X.  Then,  the  inf-projections 
hv  :  Xu  — >•  [— oo,  oo)  and  h  :  X  — >  [— oo,  oo)  satisfy 

limsup  hypo  hu  C  hypo  h. 

Proof.  Suppose  that  (x,xq)  €  limsup  hypo  hu .  Then  there  exists  a  sequence  {{xv ,Xq)}v&n-,  with  N 
an  infinite  subsequence  of  IN,  xv  €  Xu,  hu{xv)  >  Xq,  xv  ~^n  x,  and  Xq  ->-n  xq.  If  x  qL  X,  then  take 
y  €  Y  and  construct  a  sequence  yu  — >•  y,  with  yu  G  Yv ,  such  that  Fu(xu,yu)  N  —  oo,  which  exists  by 
Definition  4. 1  (i) .  However, 

xv0  <  hu{xv)  <  Fu(xu,yu),  neN, 
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imply  to  a  contradiction  since  Xq  —>n  xq  G  1R.  Thus,  x  G  X.  If  h(x)  =  — oo,  then  there  exists  y  G  Y 
such  that  F(x,  y)  <  x o  —  1.  Definition  4.1  (i)  ensures  that  there  exists  a  sequence  yv  — *  y,  with  yu  €  Y’1', 
such  that  limsup  Fu(xv,  yv)  <  F(x,y).  Consequently, 

x0  =  limsup^jv  Xq  <  limsup^jy  hv(xv)  <  limsup„eJV  Fv(xv,yv)  <  F(x,  y)  <  xq  -  1, 

which  is  a  contradiction.  Hence,  it  suffices  to  consider  the  case  with  h(x)  finite.  Given  any  e  >  0 
arbitrarily  small,  pick  ys  G  Y  such  that  F(x,ye)  —  £  <  h(x).  Then  Definition  4.1  (i)  again  yields 
yv  — >  y£,  with  yu  G  Yu ,  such  that 

limsup^gjy  hu(xu)  <  limsupye N  Fu(xu,  yv )  <  F(x,  y£)  <  h(x)  +  e, 

implying  limsup^gjv  hv{xv')  <  h(x).  Since 

xo  =  limsup^gjv  Xq  <  limsup^gjv  hu(xu)  <  h(x), 

the  conclusion  follows.  □ 

Additional  results  can  be  obtained  under  a  strengthening  of  the  lopsided  convergence  analogous  to 
tight  epi-convergence. 

4.4  Definition  (ancillary-tight  lop-convergence)  The  lop-convergence  of  bifunctions  {Fu  :  Xv  x  Yu  — » 

to  F  :  X  x  Y  — >■  ]R  is  ancilliary-tight  if  Definition  4.1  holds  and  for  any  £  >  0  one  can  find  a 
compact  set  Be  C  y,  depending  possibly  on  the  sequence  xv  — >•  x  selected  in  Definition  4.1(H),  such 
that 

inf  Fv (xv ,  y)  <  inf  Fu(xl',y)  +  e  for  sufficiently  large  v. 

yeY-'nBe  y  ;  ygY"  V  ’  6 

Under  ancillary-tight  lop-convergence,  we  can  strengthen  the  conclusion  of  Theorem  4.3  as  follows. 

4.5  Theorem  (hypo-convergence  of  inf- projections)  Suppose  that  the  bifunctions  {Fu  :  Xv  x  Yv  — » 

lop-converge  ancillary-tightly  to  F  :  X  X  Y  — >•  1R  and  —oo  <  inf y  F(x,-)  for  some  x  G  X. 
Then,  the  corresponding  inf-projections  hv  :  Xv  — >•  [— oo,  oo)  hypo-converges  to  the  inf-projection 
h  :  X  — »•  [—00,00),  i.e.,  hypo hv  set-converges  to  hypo h. 

Proof.  Since  it  is  a  very  short  proof,  we  include  it  for  completeness  sake.  It  is  verbatim  the  same  as 
that  of  [6,  Theorem  3.4].  Let  iGlbe  such  that  h(x)  is  finite.  Now,  choose  xv  — >•  x,  with  xv  G  Xv, 
such  that  Fu(xu,  •)  epi-converge  to  F(x,  •),  cf.  Proposition  4.2.  In  fact,  they  epi-converge  tightly  as  an 
immediate  consequence  of  ancillary-tightness.  Thus, 

hu(: xv)  =  inf  Fu(xu,yu)  — >•  inf  F(x,y)  =  h(x), 

ygYv  yeY 

via  Theorem  3.4.  □ 

Further  strengthening  of  the  notion  is  also  beneficial. 

4.6  Definition  (tight  lop-convergence)  The  lop-convergence  of  bifunctions  {Fu  :  Xv  x  Yv  — >• 

to  F  :  X  xY  — >•  1R  is  tight  if  Definition  4.4  holds  and  for  any  e  >  0  one  can  End  a  compact  set  Ae  (Z  X 
such  that 

sup  inf  Fu(x,y)>  sup  inf  Fu (x,y)  —  e  for  sufficiently  large  u. 
xex-'n Ae  y^YV  xeX"  yYYv 
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Under  tight  lop-convergence,  we  can  strengthen  the  conclusion  of  Theorem  4.5  as  follows. 

4.7  Theorem  (approximating  maxinf-points)  Suppose  that  the  bifunctions  {Fu  :  Xv  x  Yv  — >•  ]R}U£n 
lop-converge  tightly  to  F  :  X  x  Y  — >•  1R  and  sup^  infy  F  is  finite.  Then, 

sup  inf  Fv{x,y )  — >•  sup  inf  F(x,y). 
xeX"  yTYv  xcx  y^Y 

Moreover,  for  every  x*  £  argmaxxg^  inf y&y  F(x,y),  there  exist  an  infinite  subsequence  N  of  IN, 
{e^l^gAr,  with  £v\0,  and  {xu}u&n,  with  xv  £  argmax^g^  infyey^  Fu(x,  y),  such  that  xv  —A  x. 
Conversely,  if  such  sequences  exists,  then  sup^g^  infj/eY"  Fv(x,  y)  -A  inf^y  F(x*,  •) 

Proof.  We  refer  to  the  arguments  of  the  proof  of  [6,  Theorem  3.7].  □ 


5  Applications  and  Further  Examples 

We  now  return  to  the  context  of  optimality  functions  of  the  form  (5).  We  start  with  providing  a 
sufficient  condition  for  the  required  upper  semicontinuity  of  an  optimality  function;  see  Definition  2.1. 
We  state  the  result  for  general  inf-projections. 

5.1  Theorem  (upper  semicontinuity  of  inf-projection)  If  the  bifunction  F  :  X  X  Y  — >•  1R  is  lower 
semicontinuous,  F(-,y)  is  upper  semicontinuous  for  all  y  £  Y ,  and  Y  is  closed,  then  the  corresponding 
inf-projection  h(x)  =  infy  F{x,  -),x  £  X,  is  upper  semicontinuous. 

Proof.  Let  xv  — >  x  £  X,  with  xv  £  X.  We  show  that  the  functions  F(xu,  •)  epi-converge  to  F(x,-), 
which  are  all  defined  on  Y.  Suppose  that  yu  — >•  y,  with  yu  £  Y.  By  the  closedness  of  Y,  y  £  Y. 
Moreover,  liminf  F(xu,  yv)  >  F(x,y)  by  the  lower  semicontinuity  of  F  at  (x,y).  Consequently,  part  (i) 
of  Definition  3.1  is  satisfied.  Next,  let  yu  =  y  £  Y.  Then,  lirnsup  F(xv ,  yu)  =  limsupF(.xI/,  y)  <  F(x,y) 
by  the  upper  semicontinuity  of  F(-,y )  and  part  (ii)  of  Definition  3.1  is  satisfied.  Since  F(xu,-)  epi- 
converge  to  F{x ,  •),  Theorem  3.2  demonstrates  that  lirnsup  h(xu)  <  h(x).  □ 


Applications  of  this  theorem  to  Examples  1-3  establish  the  upper  semicontinuity  of  the  corresponding 
optimality  functions. 

We  next  turn  to  the  requirement  for  consistency  in  Definition  3.5(h). 

5.2  Theorem  (sufficient  condition  for  consistency,  optimality  function  part)  Suppose  that  the  bifunc¬ 
tions  {Fu  :  Xv  x  Yv  — >•  -KFgAr  lop-converge  to  F  :  X  x  Y  —>  M  and  that  the  bifunctions  dehne  the 
optimality  functions  9U  =  inf ygYv  Fv(-,y)  and  6  =  inf vey  F(-,y),  with  — oo  <  9(x)  for  some  x  £  X. 
Then, 

for  every  xu  — >•  x  £  X ,  with  xu  £  Xv,  lirnsup  9u(xu)  <  9(x)  if  x  £  X,  and  9u(xu)  — >•  — oo  otherwise. 

Proof.  The  result  is  a  direct  consequence  of  Theorem  4.3.  □ 

In  view  of  this  result,  it  is  clear  that  (weak)  consistency  will  be  ensured  by  epi-convergence  of  the 
approximating  objective  functions  and  feasible  sets  as  well  as  lopsided  convergence  of  the  approximating 
bifunctions  defining  the  corresponding  optimality  functions. 
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We  illustrate  Theorem  5.2  in  the  context  of  Example  3. 


Example  3:  Minimax  Problem  (cont.).  Suppose  that  for  every  z  £  Z,  there  exists  a  sequence 
zv  £  Zv  such  that  zv  — >•  z.  Let 

Fu{x,y)  =  m|x  j </>(£,  z)  -  fv(x)  +  (Vx<p(x,z),y  -  x)  +  ^||y  -  .t||2  j  ,  x,y  £  Rn, 

and  F  be  defined  similarly  with  the  superscripts  removed.  We  next  show  lopsided  convergence  of  Fv  to 
F.  First  consider  part  (i)  of  Definition  4.1.  Let  y  £  R."  and  xv  — >•  x  €  Mn.  Set  yv  =  y  for  all  v.  Clearly, 
limsup  yv)  <  limsup  F(xu,  y)  =  F(x,y)  by  the  continuity  of  F  and  part  (i)  holds.  Second,  we 

consider  part  (ii).  Let  x  £  Rn  and  yu  — >•  y  £  Rn .  Set  xv  =  x  for  all  v.  Let 

^  £  argma xzeZI<p(x,z)  -  f(x)  +  (Vx<p(x,  z),  y  -  x)  +  \\\y  -  x||2  j  . 


Let  e  >  0.  By  assumption  on  Zv  and  the  continuity  of  <p(x,  •)  and  Vx<p(x,  •),  there  exists  zv  £  Zv  and 
uq  such  that 


V{x,zv)  -  <p(x,zx)  >  -e 


| Vx(p(x,zu)  -  Vxcp(x,zx)\\  <  min  e, 


\\y-x\ 


for  all  v  >  vq.  Consequently,  v  >  vq, 


Fv{^,^)  =  Fv{x,yv) 

=  max  |  <p(x,z)  -  fu{x)  +  (S7xip(x,  z),yu  -  x)  +  \  \\yv  -  .x||2l 

z£Zv  y  Z  J 

>  <p(x,  zv)  -  f(x)  +  (Vx<p(x,  zv),yv  -x)  +  7,  II yv  -  x\\^ 

=  <p(x,zx)  -  f(x)  +  {vxip{x,zx),y  -  x)  +  ^|| y  -  x\\2  +  ip(x,zu)  -  <p(x,zx) 

+  ('Vx<p(x,zv)  -  Vxip(x,zx),y  -  x)  +  (Vx<p(z :,zu),yL'  -  y)  +  ^|| yv  -  .x||2  -  ^||y  -  x\\2 

>  <f{x,zx)  -  f(x)  +  (Vx(p(x,zx),y  -  x)  +  *||y  -  x\\2 
-£-£+  (Vx<p(x,zu),yu -y)  +  7^\\yu  ~  x\\2  - \h-x\\2 

=  F(x,  y)-2e  +  (Vxtp{x,  zv),yv  -  y)  +  ^||  yu  -  x||2  -\\\y-  a?||2 

Since  yu  — >•  y,  {zu}  is  bounded,  and  Vxy?  is  continuous,  it  follows  that  liminf  Fv{xl> ,  yu)  >  F(x,  y)  —  2e. 
Since  e  was  arbitrary,  part  (ii)  of  Definition  4.1  holds  and  Fu  therefore  lop-converge  to  F.  In  view  of 
Theorem  5.2  and  the  fact  that  epi-convergence  is  also  easily  established,  we  have  that  {( fu  :  Rn  — >• 
R,6P  :  Mn  — >  R-)}  are  consistent  approximations  of  {(/  :  Rn  —$■  R,  6  :  Rn  — >  R-)}  in  this  case. 
The  above  algorithm  therefore  is  implementable  for  the  solution  of  the  semiinfinite  minimax  problem 
minxeKn  max2gZ  ip(x,  z). 
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As  we  see  next,  under  slightly  stronger  assumptions,  the  approximating  bifunctions  do  not  need  to 
be  associated  with  an  optimality  function  to  achieve  convergence  to  quasi-stationary  points. 

5.3  Theorem  (convergence  to  quasi-stationary  points)  Suppose  that  the  bifunctions  {Fu  :  Xv  x  Yv  — » 
1R}v<zn  lop-converge  ancillary-tightly  to  F  :  X  xY  1R  and  6  =  infyey  F(-,y )  is  an  optimality  function 
for  f  :  C  — ^  M  with  Qg  /  0.  If  xu  €  argmaxa.gy„  \niy&Yv  Fv(x,  y )  for  all  u,  then  every  cluster  point  x 
of  {xu}U£n  is  quasi-stationary  for  f  :  C  1R,  i.e.,  x  €  Qg. 

Proof.  By  Theorem  4.5,  hv  =  inf y^Yv  Fv {■  ,y)  hypo-converge  to  h  =  inf^gy  F(-,  y).  By  translating 
Theorem  3.2  from  the  stated  minimization  framework  to  a  maximization  framework  we  obtain  that 
x  €  argmaxA-  h.  Since  Qg  /  0,  Qg  =  arginax^  h  and  the  conclusion  follows.  □ 

Further  characterization  of  (quasi- (stationary  points  is  available  under  tight  lopsided  convergence. 

5.4  Theorem  (characterization  of  quasi-stationary  points)  Suppose  that  the  bifunctions  {Fu  :  Xu  x 
Yv  — >•  IFL\V  giv  lop-converge  tightly  to  F  :  X  xY  M  and  6  =  inf^gy  F(-,y)  is  an  optimality  function 
for  f  :  C  — >•  M  with  Qg  /  0.  For  every  x  €  Qg  there  exist  an  infinite  subsequence  N  of  IN,  {su}u£]y, 
with  eu\0,  and  {xu}u&n^  with  xu  €  eu- argmaxxeX^  inf^gy^  Fu(x,y),  such  that  xu  —> N  x. 

Proof.  The  result  is  a  direct  consequence  of  Theorem  4.7.  □ 

We  end  the  paper  with  an  example  from  the  area  of  optimal  control  and  adjust  the  notation  ac¬ 
cordingly. 

Example  4:  Optimal  Control.  We  here  follow  the  set-up  in  Section  5.6  and  Chapter  4  of  [12],  which 
contain  further  details.  For  g  :  Mn  x  Mm  lRn,  we  consider  the  dynamical  system 

x(t)  =  g(x(t),u(t)),  for  t  €  [0, 1],  with  x(0)  =  £  €  IRn, 

where  the  control  u  €  =  {it  :  [0, 1]  -»  lRm  \  measurable,  essentially  bounded}.  Since  such  controls 

are  contained  in  the  space  of  square-integrable  functions  from  [0, 1]  to  Mm,  the  usual  L2-norm  applies; 
see  [12,  p.709]  for  a  motivation  for  this  “hybrid”  set-up.  Let  H  =  IRn  x  £™.  For  initial  condition  and 
control  pairs  g  =  (£,  it)  €  H  and  fj  =  (^,  u)  €  H,  we  equip  H  with  the  inner  product  and  norm 

(V,V)H  =  (^0+  f  (u(t),u(t))dt  and  \\g\\n  =  {g,g)n- 
Jo 

We  consider  control  constraints  of  the  form  u(t)  €  C,  for  almost  every  t  €  [0, 1]  for  some  given  convex 
and  compact  set  C  C  Mm .  By  imposing  the  constraints  for  almost  every  t  instead  of  every  t,  we  deviate 
slightly  from  [12]  and  follow  [9].  We  therefore  also  define  the  feasible  set 

U  =  fl  {u  |  u(t)  €  C,  for  almost  every  t  £  [0, 1]}  and  H  =  Mn  x  U. 

Under  standard  assumptions,  a  solution  of  the  differential  equation,  for  a  given  g  €  H,  denoted  by  xv  is 
unique,  Lipschitz  continuous,  and  Gateaux  differentiable  in  g.  Consequently,  for  a  given  <p  :  lRn  x  Mn  — > 
1R,  Lipschitz  continuously  differentiable  on  bounded  sets,  the  function  /  :  H  IR  defined  by 

f(v)  =  <P(€,Xv(  1))’  for  J  €  H > 
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has  a  Gateaux  differential  of  the  form  (V/(r/),?y  —  77)%  for  some  Lipschitz  continuous  gradient  V/(?y) 
given  in  [12,  Corollary  5.6.9].  The  optimal  control  problem 

minimize  /(r?)  subject  to  77  £  H, 

analogous  to  Example  1,  has  an  optimality  function 

9{if)  =  min  for  77  £  H, 

r/£H 

where 

F(V,V)  =  (^f(v),V-v)n  +  ^\\V~V\\ u,  for  77,77  £  H. 

We  next  consider  approximations.  Let  Uv  C  U,  v  £  IV,  consist  of  the  piecewise  constant  functions 
that  are  constant  on  each  of  the  intervals  [(k  —  l)/v,  &/zz),  k  =  1, ...,  v.  Set  Hv  =  Ft'1  x  [/U  Moreover, 
let  x"  be  the  (unique)  solution  of  the  forward  Euler  approximation  of  the  differential  equation,  using 
time-step  1/v,  given  input  77  =  (£,it)  €  7t.  An  approximate  problem  then  takes  the  form 

minimize  fu(r 7)  subject  to  77  €  Hv , 


where 

f(v)  =  <(!))• 

One  can  show  that 

9l'{rf)  =  min  Fu(rj,fj),  for  rj  £  Hu , 

where 

Fu(r],f fj)  =  (V fl'(r]),fj  —  rfj'n  +  ^||f7  —  ??||^,  for  77,  fj  £  Hu, 

is  an  optimality  function  of  fv  :  H1'  — >•  Ft,  where  the  Lipschitz  continuous  gradient  V  fu(rj)  is  given  in 
[12,  Theorem  5.6.19]. 

By  [12,  Theorem  4.3.2],  for  every  bounded  set  S  C  H,  there  exists  a  Cs  <  00  such  that  1/(77)  — 
fu(r])\  <  Cs/v  and  ||V/(?7)  —  'Vfu{r])\\n  <  Cs/v  for  all  77  €  S.  Moreover,  U \u&nHv  is  dense  in  H. 
Consequently,  it  is  easily  established  that  fv  :  Hu  — >•  Ft  epi-converge  to  /  :  H  — >•  Ft.  We  next  consider 
the  optimality  functions.  Let  fj  £  H  and  77^  — >  77  €  T~L,  with  r/u  £  Hu .  Necessarily,  77  €  H.  Due  to  the 
density  result,  there  exists  fju  — *  fj,  with  fju  £  Hu.  Hence, 

\F"(,f,rf)  -  F(v,fj)  1  <  ||  vr(V')  -  v/(,)||„||f," 

+  ir v/wn«nr  -rf-ri+ .,n«  +  \  11  if  -  „ii,  -  inr  -  <ni«  -+  0 

and  we  have  shown  Definition  4.1  (i) .  Using  similar  arguments,  we  also  establish  part  (ii)  and  the 
lopsided  convergence  of  Fu  to  F.  Consequently,  {(/"  :  Hv  —f  Ft,  9U  :  Hu  — >•  Ft_)}ue]\r  are  consistent 
approximations  of  (/  :  H  — >  Ft,  9  :  H  —f  Ft _).  Since  the  minimization  of  fv  :  Hv  —f  Ft  is  equivalent  to 
an  optimization  problem  on  a  Euclidean  space,  the  above  algorithm  is  implementable  for  the  infinite¬ 
dimensional  problem  /  :  Ft  —f  Ft. 
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